Automating Construction Work Data-Oriented Parsing and Constructivist Accounts of Language Acquisition
نویسندگان
چکیده
The constructionist approach to language has long proven its merits as a theoretical framework guiding linguistic observations. However, relatively little work has been dedicated to providing a precise, formalized definition of constructions and the mechanisms by means of which they are acquired. In giving an overview of recent work in Data-Oriented Parsing (DOP), we show how the theoretical development of construction grammar and usage-based approaches to language acquisition can benefit from the converging evidence and novel insights that computational models such as DOP can provide us with. In this chapter, we introduce DOP and compare its properties to usage-based and constructionist ideas about the nature of grammar and its acquisition. We discuss the unsupervised incarnation of DOP, U-DOP, and show how it can be used to address nativist hypotheses about the learnability of grammatical patterns. Finally, we propose an extension of the formalism that is able to learn a meaning-driven grammar from unstructured input data.
منابع مشابه
Corpus-Based Lexical Acquisition For Semantic Parsing
Building accurate and e cient natural language processing (NLP) systems is an important and di cult problem. There has been increasing interest in automating this process. The lexicon, or the mapping from words to meanings, is one component that is typically di cult to update and that changes from one domain to the next. Therefore, automating the acquisition of the lexicon is an important task ...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملRobust Sub-Sentential Alignment of Phrase-Structure Trees
Data-Oriented Translation (DOT), based on DataOriented Parsing (DOP), is a language-independent MT engine which exploits parsed, aligned bitexts to produce very high quality translations. However, data acquisition constitutes a serious bottleneck as DOT requires parsed sentences aligned at both sentential and sub-structural levels. Manual substructural alignment is time-consuming, error-prone a...
متن کاملIncorporating Cognitive Linguistic Insights into Classrooms: the Case of Iranian Learners’ Acquisition of If-Clauses
Cognitive linguistics gives the most inclusive, consistent description of how language is organized, used and learned to date. Cognitive linguistics contains a great number of concepts that are useful to second language learners. If-clauses in English, on the other hand, remain intriguing for foreign language learners to struggle with, due to their intrinsic intricacies. EFL grammar books are ...
متن کاملتأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کامل